# Relative position encoding
Beit Base Patch16 224
Apache-2.0
BEiT is a Vision Transformer-based model pre-trained on ImageNet-21k through self-supervised learning and fine-tuned on ImageNet-1k for image classification tasks.
Image Classification
B
microsoft
58.34k
9
Beit Large Patch16 224
Apache-2.0
BEiT is an image classification model based on Vision Transformer (ViT) architecture, pretrained with self-supervised learning on ImageNet-21k and fine-tuned on ImageNet-1k.
Image Classification
B
microsoft
222.46k
1
Transfo Xl Wt103
Transformer-XL is a causal Transformer architecture that uses relative position encoding. It can capture longer context by reusing previously computed hidden states, making it suitable for text generation tasks.
Text Generation
Transformers English

T
transfo-xl
4,498
15
Featured Recommended AI Models